ENHANCED OUTPUT−BASED PERCEPTUAL MEASURE FOR PREDICTING SUBJECTIVE QUALITY OF SPEECH (ThuAmPO4)
نویسنده
چکیده
This paper presents an enhanced version of a non−intrusive measure for assessment of speech quality of voice communication systems and evaluates its performance. The new measure, which uses only the output of the system, is based on measuring perception−based objective auditory distances between voiced parts of the output (processed) speech whose quality is to be evaluated to appropriately matching references extracted from one of four pre−formulated codebooks, depending on their estimated pitch values. The codebooks are formed by optimally clustering large number of parametric speech vectors extracted from a database of clean speech records. The measured auditory distances are then mapped into equivalent subjective Mean Opinion Scores (MOS). The required clustering and matching process was effected by using an efficient data−mining tool known as the Self−Organizing Map (SOM). The short−time Bark Spectrum analysis is used in order to achieve perception−based, speaker−independent parametric representation of the speech. Reported evaluation results show that the proposed enhanced speech quality assessment method provides quality scores that are highly correlated with MOS obtained by formal subjective listening tests.
منابع مشابه
Output-Based Objective Measure for Non-Intrusive Speech Quality Evaluation
This paper describes a newly developed output-based method for non-intrusive evaluation of speech quality of voice communication systems, and evaluates its performance. The method, which uses only the output of the system, is based on measuring perceptually motivated objective auditory distances between the voiced parts of the speech signal whose quality to be evaluated to appropriately matchin...
متن کاملNon-intrusive assessment of perceptual speech quality using a self-organising map
A new output-based method for non-intrusive assessment of speech quality for voice communication system is proposed and its performance evaluated. The method is based on comparing the output speech to an appropriate reference representing the closest match from a pre-formulated codebook containing optimally clustered speech parameter vectors extracted from a large number of various undistorted ...
متن کاملEvaluation of objective measures for speech enhancement
In this paper, we evaluate the performance of several objective measures in terms of predicting the quality of noisy speech enhanced by noise suppression algorithms. The objective measures considered a wide range of distortions introduced by four types of real-world noise at two SNRs by four classes of speech enhancement algorithms: spectral subtractive, subspace, statistical-model based and Wi...
متن کاملPsychoacoustic filtering for noisy speech enhancement
A new denoising approach is introduced in this paper. It is based on the fact that denoising may be performed by mimicking the human ear function in order to improve the psychoacoustics appearance of speech signal. In the proposed method, the speech signal is decomposed by using a gammatone filterbank in accordance with ERB scale. Spectral attenuation filtering is then applied in each sub-band ...
متن کاملThe Investigation of Frame Disturbance (fd) in Perceptual Evaluation Speech Quality (pesq) as a Perceptual Metric
Satisfying customers’ needs economically is one of the important aspects in mobile communication industry. Provider should cater a good and consistent quality of service as expected by the customers. Hence, it is amounts to controlling the speech quality perceived by the customers. However, to control the speech quality, the reliable measurement of the speech quality must be determined first, t...
متن کامل